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EPO- Munich 

DESCRIPTION 24 

29. Juni 2002 

A Feedback process to model CP Utilization 
Multi-Processor Performance Models 

1. BACKGROUND OF THE INVENTION 

1.1. FIELD OF THE INVENTION 

The present invention relates to a method and system 
implementing feedback algorithms for controlling a given 
simulation model. In particular, it relates to a computer- 
program-based method and system for providing a feedback control 
for a given set of control quantities of a simulation model, 
comprising a plurality of iterated simulation runs in which a 
single simulation run consumes a considerable amount of time 

1.2. DESCRIPTION AND DISADVANTAGES OF PRIOR ART 

The present invention will be next defined from prior art on the 
specific field of computer system modeling. In particular, the 
invention will be illustrated by an exemplary application to a 
large scale multi-processor (MP) performance model with special 
focus on storage hierarchy (SH) performance. 

In the inventive context the term model refers to a time- 
dependent simulation model in contrast to an analytical spread 
sheet model: Simulation models are true cycle-by-cycle models of 
computer systems. In particular, queuing delays and resource 
utilizations are a result of simulated data transfer. Analytical 
models instead use mathematical formula to calculate queuing 
delays from given rates and utilizations. 

In large-scale compute environments, it is generally recommended 
that central processor (CP) utilization should not exceed 90%. 
Otherwise, excessive queuing delays for shared system resources 
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can severely impact system performance. Therefore, performance 
benchmarks used to test real hardware are also usually performed 
at the same CP utilization of 90%. 

Consequently, simulation models of real hardware must be able to 
accurately simulate any given CP utilization. When a multi- 
processor model ignores CP utilization and hence runs at a 
utilization of 100% instead of 90%, the load on the storage 
hierarchy cooperating closely with said CPs is significantly 
overstated and not representative for real customer 
environments. Sample model runs ignoring CP utilization turned 
out to mispredict total system capacity by as much as 7.5%. A 
misprediction of that magnitude can lead to wrong and hence 
costly design decisions and - even worse - to a wrong 
positioning in the marketplace. 

The difficulty to correctly model a given CP utilization results 
from the fact that CP busy time and hence CP utilization 
includes both pure instruction processing time and the time in 
which the CP is waiting for storage hierarchy (SH) requests to 
be resolved. The wait time in turn depends on the load of the SH 
and thus indirectly on CP utilization. As this functional 
relationship is unknown, prior art computer simulation models 
often ignore CP utilization completely and operate constantly at 
a utilization of 100% - with the exposures and consequences as 
mentioned above. 

The inventive approach presented here applies both to computer 
system models driven by instruction traces and driven by event 
rates . 

In models driven by instruction traces, the simulated CPs read, 
interpret and execute instruction sequences. In this case, CP 
utilization can be simulated by suspending instruction 
processing for a while at adequate points in the instruction 
stream - that is by artificially inserting periods of user think 
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time. Event driven models instead have drivers implemented which 
statistically generate requests to the storage hierarchy. 
Typical requests are Ll cache misses. Their frequency is an 
input parameter to the model and CP utilization is reflected in 
the interarrival time of events which is the average number of 
processor cycles between two events: The higher the CP 
utilization, the more requests are issued per unit of time. 

In both model types, the leverage to manipulate a CP's 
utilization is its relative utilization during the time in which 
it is not waiting for a SH request to be resolved. This time is 
referred to herein as the CP's entry utilization x . Then, the 
total utilization U is a workload dependent and system dependent 
function of x , 0=M*). As mentioned above, t* includes CP wait times 
and hence is unknown, and the underlying utilization modeling 
problem consists in choosing * such that p^mG*) converges against 
the utilization aimed at. This utilization quantity is referred 
to herein as the target utilization u . 

As mentioned before, prior art computer simulation models often 
ignore CP utilization completely. If, however, such prior art 
involves the predetermined setting of CP utilization, the most 
obvious approach to make f* converge against u is the so-called 
regulation technique . 

Said prior art regulation techniques use the following iterative 
approach which consists of 

a) defining a start value for the entry control quantity, 

b) performing a current run of simulation with this start value, 
thus yielding a resulting value for the target control quantity, 

c) comparing the resulting target quantity to its value aimed 
at, 

d) defining a new starting value in dependence of the existing 
result, and 
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e) performing a next run of simulation with this new starting 
value . 

Of course, instead of being 1 -dimensional , the starting value as 
well as the control quantity can be a multi-dimensional vector 
reflecting the multiple dimensions in a simulation system. 

In prior art, the algorithm to choose the entry utilization 
works as follows: Starting with some entry value X = A, the 
simulation model runs for a while, then determines the simulated 
total utilization "i and chooses a new value for x to be selected 
for the next iteration of the simulation run: If "i <", then x 2 >x \ 
is chosen and vice versa. *' denotes the instantaneous value of x 
valid for the next iteration. The quality and speed of 
convergence depend on the step-sizes chosen for the difference 
between A and A and on the initial value chosen for * and may 
even vary with the value of the target utilization In 
particular, this heuristical method is exposed to over-correct x 
and hence may lead to an undesired wide oscillation of the 
simulated utilization around M . 

Disadvantageous ly, prior art provided feedback control 
algorithms may even become complex, as soon as the convergence 
history is used to determine the starting value of x for the next 
iteration of the simulation run. Thus, the drawback of prior art 
is that 

a) it converges too slowly, which implies long-lasting 
simulation runs, 

b) it is exposed to an undesired oscillation around the target 
utilization . 

1.3. OBJECTIVES OF THE INVENTION 



It is thus an objective of the present invention to provide a 
method and system which 
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a) saves computational time in computer simulations working 

according to the above sequence principle 

b} avoids wide oscillation around the target value of 

utilization. 

2 . SUMMARY AND ADVANTAGES OF THE INVENTION 

This objective of the invention is achieved by the features 
stated in enclosed independent claims. Further advantageous 
arrangements and embodiments of the invention are set forth in 
the respective subclaims. Reference should now be made to the 
appended claims. 

In contrast to prior art algorithms as described above, the 
inventive algorithm uses one closed formula which was discovered 
within the present invention to choose the next instantaneous 
value *' for the entry control quantity *. This value applies 
advantageously unchanged to any target control quantity like 
the before-mentioned target utilization, implies excellent 
convergence of and implicitly regards the convergence history. 

Thus, according to its basic aspect, a computer-program-based 
method for providing a feedback control for a given set of entry 
and target control quantities * and u of a system model is 
disclosed, the method comprising a repetition of the following 
steps : 

a) providing a starting value A for each of the said entry 
control quantities x in the model, 

b) running the model based on said starting values and obtaining 
a resulting actual value for each of said target control 
quantities « 
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c) using the values obtained for u to define a new start value 
for x for use in a repeated modelling step, 



whereby the method is characterized by comprising the following 
formula to calculate the respective next value of the entry 
control quantities : 



X rvA 



^(H) (6a) 



where Pn is a suitable parameter and 



(6b) 



x 'n is valid for the next iteration only while "« and Pn are values 
measured from the beginning of the simulation. 

Instead of being given directly, the starting value or starting 
vector x may also be provided by derivation from other setting 
quantities * is dependent from, if such alternative setting means 
are available in the system. 

When said control quantities are CP utilizations in computer 
system models simulating multi-processor systems, then 
significant simulation time can be saved, and bring-up costs and 
bring-up time for new-developed computer hardware can be 
significantly reduced. 

Therefore, the inventive method and its cost and time savings 
are of significant interest to any computer manufacturer 
building large scale multi-processor systems. Beyond the field 
of computer system simulations, the inventive algorithm 
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(8) turns into (6), if is assumed to be sufficiently close to 

" and hence Pn+i close to Pn . 

Further, formula (5) suggests that the accumulated entry 
utilization at time T n+\ will be close to 

X » +1 = l+p n (l-u) (io) 
This is equivalent to the iterative formula 



Xn+1 ~ l+dn(l+PnXn) (11) 



where d n =(u n -u)/u. 

From (11), it follows that ^«+i^^and hence w„+i<«« if u n >u and 
vice versa. This guarantees that ^ u n) in fact converges against 
the target CP utilization u or at least oscillates sufficiently 
close around w . 



4.2 DETAILS OF PREFERRED EMBODIMENTS 

With general reference to the figures and with special reference 
now to fig. 1 the considerably large field of use for the 
inventive algorithm is intended to be illustrated. The actual , 
core of the present invention is the formula (6a, 6b) given 1 
above . 

This formula can be used according to the present invention in 
any technical application that uses a feedback mechanism to 
control a system of working units 10A, 10B, 10D which are 

switched in parallel and which access a common resource 14 . Each 
of said units may produce an output of material or immaterial 
nature. Examples are working pieces treated in such units 10, 
like for example a car in the car producing industry, or an 
electrical current as an example for an immaterial output. 
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Assuming Wj=Pi or p\ = 1 / the iteration starts with the entry- 
utilization x x -x\-u/{2-u) t where u is the target utilization. 

In a rate driven model, this determines the request rate during 
non-wait time in Ai = [T 0f Ti) . j n a model driven by instruction 
traces, it determines the user think time - ie the length of 
idle periods to be inserted into the instruction stream within 
A, . 

During A i , the model records the simulated wait time W, = Wi, and 
at time T u P\=Pi and P\ = Wi/Pi are deduced from formulas (1) 
A -W, =/>; + /; and (3) x\=P\/(P\+l\). 

From- (2), u\={P\+W\)IT\ is calculated, and formula (6) is applied 
to arrive at the entry utilization x i to be used in A 2 = [ri,r 2 ]. 
During A 2/ the model records the simulated wait time ^2, and at 
time similarly like before at time T i> p 2 and hence P2 = P\ + ^2, 
W 2 = Wi + W 2 and P2 = W 2 /P 2 ar e deduced from formulas (1) A-W 2 =/> 2 + / 2 a nd 

(3 ) 4 = W 2 +/ 2 ). 

Then from (2), "2 = (^2 + W 2 )/T2 is calculated and the procedure can be 
iterated as often as required by any finish-criterion. 

The following reasoning is added in here, in order to make the 
formula easier to understand: 



The instantaneous version of formula (5) valid within A„ reads 

/ 

' Un 

X n / f /N 

1+Pn{l-Hn) (8) 

Since obviously 



u n = nu n -in - l)u n -\ 



(9) 
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In the following formulas, 

u denotes the target utilization finally aimed at, 

A n = [T n ~\,T n ]>w±th T„=nA, ± s the n-th interval of observation time, 

I'n> ^*and w 'n denote the idle, processing and wait time within A « , 

f«« ^«and w n the respective accumulative times up to T n , 

u 'n and x 'n denote the instantaneous total and entry utilization 

within A «, and 

u n and x n the respective accumulative utilizations up to T n . 

Relying on formula (5) , all information available at time is 
used to choose the next value for the entry utilization. Thus, 
the inventive key regulation formula reads: 



' V n 



X nhl 



1+pnQrVn) (6a) 



where Pn = W n /P n and 



Vn—in+^U—rUAn 



(6b) 



Since w « aims to approach the simple choice 



n+l \+p n (l-u) ( 7 ) 

also makes CP utilization converge against " - slower, however, 
than with formula (6a, 6b) . 

The inventive embodiment of the feedback process is disclosed as 
follows : 
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The following time and utilization variables are used: 



Total run time 7 is the sum of system idle time l , instruction 
processing- time p and system wait time w : 

+ (1) 

During P, the system is assumed to process instructions at 
infinite cache speed. w is the total time a CP is delayed 
waiting for a storage hierarchy (SH) request to be resolved. 
Then, total CP utilization at time 7 is 

u T = (P + W)/T y (2) 

and the "entry control quantitity" , ie, the CP's entry 
utilization - ie, its relative utilization during P - becomes 

x T = PKP + I). ( 3 ) 

Straightforward arithmetic shows 

_ (1+Pr)*r 
U T — 1+pjXT (4) 



Where Pt-WP, or equivalently 

XT ~~ l+piil-ur) m (5) 

The feedback-process implemented according to this preferred 
embodiment adjusts x every A cycles in such a way that the actual 
CP utilization approaches the target utilization aimed at, which 
is referred to as ''target control quantity". 
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3 . BRIEF DESCRIPTION OF THE DRAWINGS 

The present invention is illustrated by way of example but is 
not limited by the shape of the figures of the drawings in 
which: 

Fig, 1 is a schematic block diagram representation of a 
system under observation and simulation, to which the inventive 
approach applies; 

Fig. 2 is a schematic block diagram representation 
illustrating the functioning of the inventive control mechanism 
when applied to a plurality of transportation or processing 
units the throughput of which is dependent on external 
parameters ; 

Fig. 3 is a schematic block diagram representation 

illustrating a multi-processor performance model which is 
controlled and improved by the inventive method; 

Fig. 4 is a schematic block diagram representation 

illustrating the system structure of the IBM z900 series as an 
example for a real large scale multi-processor system which can 
underlie the model structure shown in fig. 3; and 

Fig. 5 is a chart showing CP utilization over runtime 
illustrating the efficiency of the inventive feedback algorithm 
when compared to prior art control methods. 

4 . DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENT 

* 

4 . 1 INTRODUCTION OF THE FORMULA APPLIED TO COMPUTER SYSTEM 
MODELS 

The inventive method is described in more detail by way of 
example applied to computer system simulation models. 
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disclosed also applies to and is beneficial to any field of 
technical simulations where a single simulation run consumes a 
considerable amount of time which is desired to be shortened. 

However, its applicability is not restricted to simulation 
models . The inventive method may also be used to control any 
real production process, provided the process has the following 
characteristics : 

First, the long term utilization of the processing machine must 
not exceed a certain limit (maybe in order to avoid burning out 
of , temperature-sensitive parts). 

Second, every once and a while, the work piece must be taken out 
of the main production stream for side processing (such as for 
heating up) before main processing can continue. 

1 

Third, the duration of the side process is not predictable, such 
as^ when several main production lines compete for only one side 
processing facility, and, 

finally, the main processing system may not be turned off during 
side processing, because the work piece must be further 
processed immediately when it returns to the main production 
line. 

In that case, in order to keep the main processing system's long 
term utilization below the given limit, but as well near to that 
limit, the system must be turned off for a certain amount of 
time every once and a while. As a skilled person may appreciate, 
this situation is equivalent; to the CP utilization problem in an 
instruction driven multi-processor simulation model as described 
above. Examples in the above sense are steel and rolling mills, 
or in particular bio-chemical production processes. 
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Each of said units 10 is controlled by an associated driver unit 
12A, 12B, 12D. Without restricting the generality of the 

present invention, a single driver unit 12 may also control more 
than one of the working units 10. 

Each working unit 10 is connected to said common, shared 
resource 14. The individual workload capacity of each unit 
depends on the degree up to which said common shared resource 14 
is loaded. 

When said common resource 14 performs poorly, the throughput of 
the working units 10 can be impacted severely. The inventive 
algorithm is intended to control the activity or load factor of 
each of the working units 10 in such a way that the shared 
resource 14 is in an operational status which is basically non- 
blocking. The reason, why an overload of the common shared 
resource 14 may result in a blocking of this system component 
will, of course, vary according to the applied field of use. A 
central feature of the system is the interdependency between the 
working units: The throughput of each individual working unit 10 
depends on the load of the other working units 10. 

Fig. 2 illustrates a more general exploitation of the inventive 
control mechanism. It applies to any transportation, mailing or 
processing units whose throughput depends on parameters outside 
to the unit itself. Again, the respective unit under observation 
is depicted with reference sign 10. It is driven by some driver 
unit 12. Its throughput is limited by some throughput limitator 
14 which may vary from case to case. 

The inventive mechanism presented is used to keep the throughput 
of the working unit 10 within a given tolerance band of for 
example +~2% around a given required throughput of e.g. 90% of 
the theoretically maximum throughput. 
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Thus, the working units' throughput is the target control 
quantity u in the sense of the appended claims. The throughput is 
synonymously denoted with a more generalised term of 
'utilization' of the working unit 10. The load imposed by the 
driver unit is the working unit's entry utilization *' . 

According to the generalized approach, a utilization measurement 
facility 20 measures the actual utilization "« of the working 
unit 10 by means of a predetermined scanning scheme and reports 
the measured utilization values to a control element 22 . 
Measurement and report frequency may vary according to any 
physical requirements present in the respective application 
case . 

The control element 22 stores a predetermined value of target 
utilization " associated with the working unit 10. The control - 
element 22 processes the measured actual utilization "» by means 
of the inventive formula (6a, 6b) and calculates a new entry 
utilization x n+i which the driver unit 12 uses to drive the 
working unit 10 until the next utilization measurement is 
performed. Thus, a closed loop control is implemented, involving 
the inventive formula. 

The loop connects two subsequent observation periods by 
providing a new start value for the entry utilization of the 
working unit 10. By means of the inventive formula, the new 
entry utilization is calculated from the current utilization of 
the working unit 10, which has just been measured. 

In fig. 2, only one working unit 10, driver unit 12 and 
utilization measurement facility 20 are depicted. It shall be 
understood, however, that multiple copies of these devices may 
co-exist. The input from each of the respective utilization 
measurement facilities 20 can than be fed into one common, 
single control element, which performs the above-mentioned 
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calculations and which in turn provides a new starting value to 
each of said driver units 12. Thus, as can be appreciated by a 
person skilled in the art, any prior art closed loop feedback 
control can take profit from the inventive algorithm. The 
advantage that results depends on the actual technical area in 
use. When, for example, the system is a complex simulation 
model, the technical effect associated with the present 
invention is to save computation time. When, however, the 
technical effect produced by the present invention is to provide 
for a very fast converging behavior, i.e. when by virtue of the 
present invention, the target control quantity such as the 
throughput of the working unit can be controlled such that some 
required value is quickly reached in practice, then, any 
individual physical technical effect and advantage can be 
achieved, the nature of which varies from the applicational 
field, again. When, for example the watermark of a system of 
rivers shall be controlled by the inventive approach, it is very 
important to avoid peaks of the control quantity (watermark) 
because both, a positive and a negative excess of the tolerated 
tolerance band may have severe effects on the ships being 
present on the rivers, or for inhabitants and infrastructure of 
the river ' s embedding landscape . 

Fig. 3 illustrates a preferred implementation of the inventive 
approach in a multi-processor performance model. As an example 
for a possibly underlying real computer system, fig. 4 depicts 
the structure of the IBM z900 series. 

In fig. 3, the system under observation is the box with 
reference sign 30. It corresponds to the total of fig. 1. Its 
major hardware components are a given number of processors, each 
including a level 1 cache (Ll-cache) , depicted with reference 
sign 32, a common shared level 2 cache 34 (L2-cache) , and a 
common shared main memory 36. 
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The processors 32 are both working units and unit drivers. They 
generate requests to the L2 -cache 34 and wait until the 
preceding request is complete before a new request is launched. 
In the schematic model depicted in fig. 3, a statistical number 
generator is used to determine the time at which the next 
request to the L2-cache 34 is issued. Alternatively, in trace 
driven models, a trace reader working on an instruction trace 
would be used instead of the statistical number generator. 

According to the preferred embodiment given in here, the time 
each processor 32 spends waiting for a request to complete is 
recorded in a table. Together with the processor's entry 
utilization, this wait time determines the actual load "r of each 
processor 32 at any time T. as detailed above, an accurate 
simulation of the CP utilization is key for a high quality 
performance model. 

The control elements of the model are shown outside box 30. 

The highest priority control element depicted with reference 
sign 38 is a module called service. When starting a simulation 
run, the service module 3 8 invokes an initialization routine 40 
which provides all parameters describing the system 
configuration and the workload to be modeled. In particular, the 
initialization routine reads all the request rates and the 
desired target utilizations of the processors from input files 
associated with the workload to be modeled. 

With regard to the target utilization w , a first entry 
utilization A is selected in a plausible way. From x \ and from 
the request rates read- in before, the mean time r = T i between any 
two requests is determined. Said mean time is referred to 
herein as in terarrival time. 
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Next, the service module 3 8 starts as many instances of the 
processor modules 32, as CPs are to be simulated. As mentioned 
above, the processor modules 32 generate storage hierarchy 
requests based on the actually valid interarrival time of 
requests and record their duration. 

During all simulation time, the service module 3 8 remains in a 
program loop, which invokes the statistics and the 
initialization routine in predetermined time intervals. The 
statistics routine corresponds to the utilization measurement 
device depicted with reference 20 in fig. 2. 

Upon each call, said statistics routine calculates the actual 
total utilization "« of the processors 32 from their entry 
utilization x 'n and the table-stored request wait time as 
described above in the introductive theoretical section 2.2. The 
initialization routine in turn uses the inventive formula (6a, 
6b) to calculate the entry utilization x n+i for the next 
observation period from M « and Finally, from the new value of 
the entry utilization, the mean interarrival time t = t«+i- is re- 
calculated and used from now on by the processing units 32. 

For improved clarity and as a supplemental inventive disclosure, 
the following section provides the pseudocode to the preferred 
embodiment of the invention illustrated by fig. 3 and fig. 4. 
For simplicity, the instantaneous entry utilization is denoted 
by y rather than *' : 

Service /* main controller */ 



{ 



Inits (virgin = TRUE) 

initialization* / 

Generate CP-Drivers 0, . . . ,m 



/* MP system */ 



/* initial 



Loop here for run__time 
{ 



Inits (virgin = FALSE) 

} 



Advance Delta cycles 



Stats (run^end = FALSE) 



/* observation period */ 
/* calculate utilization */ 
/* adjust entry util. */ 
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Stats (run_end = TRUE) 



/* final run statistics */ 



Inits (virgin) 
{ 

If virgin { 



/* Initializations */ 

/* on run start */ 



Read input parms 

specifically request rates r 
and aimed-at utilization u 

Make guess on entry _util y = y.l 

util */ 

} Else { 



/* initial entry 



/* each observ period */ 
/* use algorithm */ 
Calculate entry_util y = y . n = y (u . n, u) 
/* u.n from stats */ 



Calculate interarrival time Tau = Tau(y,r) of requests 



CP Driver 

{ 

Loop here for run_time 
{ 

Determine time to next request: 
t = Distribution (Tau) 

Advanc e t eye 1 e s 

Launch request 

Wait until request_done 

Record wait time w 



/* m = MP instances */ 



/* typically: */ 
/* Exponential distrib*/ 

/* to next request */ 



/* for stats */ 



Stats (run_end) /* Statistics */ 

{ 

If (not run_end) { /* each observ period */ 

/* w recorded by CP */ 
Calculate actual utilization u.n = u(w,y.n) 

/* y = entry util */ 
} Else { /* run complete */ 

Evaluate and report performance statistics 

} 

} 
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4.3 EFFICIENCY OF THE INVENTION COMPARED TO PRIOR ART 

Fig. 5 illustrates the efficiency and advantages of the 
inventive method by comparing its convergence behavior to prior 
art methods . 

The first prior art method is the stepwise approach, the curve 

of which is depicted with reference sign 51: The instantaneous 

entry utilization gets adjusted by a fixed value d , that is 

*n+i ~ x 'n +d, which changes its sign and gets divided by 2 as soon as 

the utilization crosses the utilization aimed at: d-+-d/2 m This is 

the approach with the poorest convergence among the alternatives 

shown. 

Better convergence is achieved with further approaches depicted 
by the graphs with reference signs 52 and 53 : Instead of the 
fixed value d used in method 51, they use the actual deviation 
from target utilization d n ~(u„-u)/u to adjust entry utilization. 
Two approaches are shown: 

The straightforward approach x 'n+i +4) is a second prior art 

method depicted by reference sign 52 . A choice with noticeably 
better convergence exploits the instantaneous version of formula 
(11) which reads x n+i = +<fn(l +Pn x n)) . This is an inventive 
embodiment of the general inventive formula (6a, 6b). Its graph 
is depicted with reference sign 53. 

By far best performing is the approach which directly uses the 
inventive formula (6a, 6b) . It is depicted with reference sign 
50. Excellent convergence is reached at a time where prior art 
curves still show unacceptable high fluctuation. 

The present invention can be realized in hardware, software, or 
a combination of hardware and software. A tool according to the 
present invention can be realized in a centralized fashion in 
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one computer system, or in a distributed fashion where different 
elements are spread across several interconnected computer 
systems. Any kind of computer system or other apparatus adapted 
for carrying out the methods described herein is suited. A 
typical combination of hardware and software could be a general 
purpose computer system with a computer program that, when being 
loaded and executed, controls the computer system such that it 
carries out the methods described herein. 

The present invention can also be embedded in a computer program 
product, which comprises all the features enabling the 
implementation of the methods described herein, and which - when 
loaded in a computer system - is able to carry out these 
methods . 

Computer program means or computer program in the present 
context mean any expression, in any language, code or notation, 
of a set of instructions intended to cause a system having an 
information processing capability to perform a particular 
function either directly or after either or both of 
the following 

a) conversion to another language, code or notation; 

b) reproduction in a different material form. 
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claims 29.Junl 2002 

A computer-program-based method for providing a feedback 
control for a given set of entry and target control 
quantities x and u of a system model, the method comprising 
a repetition of the following steps: 

a) providing a starting value A for each of said entry 
control quantities x in said model, 

b) 

running the model based on said starting values and 
obtaining a resulting actual value for each of said target 
control quantities 

c) using the values obtained for u to define a new start 
value for x for use in a repeated modeling step, 

whereby the method is characterized by comprising the 
following formula to calculate the respective next value of 
the entry control quantities: 



(6a) 



where Pn is a suitable parameter and 



v n =(n+i)u—riu n 



(6b) 



x 'n is valid for the next iteration only while and Pn are 
values measured from the beginning of the simulation. 



The method according to claim 1 in which said control 
quantities are CP utilizations in a computer system model 
simulating a multi-processor system. 
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A computer system having means for performing the 
steps of a method according to one of the preceding claims 
1 to 2. 

A computer program for execution in a data 
processing system comprising computer program code portions 
for performing respective steps of the method according to 
anyone of the preceding claims 1 to 2 , when said computer 
program code portions are executed on a computer. 

A computer program product stored on a computer 
usable medium comprising computer readable program means 
for causing a computer to perform the method of anyone of 
the claims 1 to 2 , 

when said computer program product is executed on a 
computer . 

A feedback control device having means for 
performing the steps of a method according to claim 1. 
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The present invention relates to feedback control algorithms for 
controlling a given simulation model. In particular, it relates 
to a computer-program-based method and respective system for 
providing a feedback control for a given set of control 
quantities of a simulation model, comprising a plurality of 
iterated simulation runs each of which consumes a considerable 
amount of time. Each single run is performed with a setting of 
starting values for certain entry control quantities suited to 
control certain target quantities. The respective next setting 
of entry quantities is based on the value of the target 
quantities measured in the preceding simulation run. In order to 
provide for a fast convergence of the target quantities to a 
final, predetermined value, the following formula is applied: 



l^H) (6 a) 



where 



v n =(n+l)u—riu ri 



(6b) 



" is the final value of the target control quantity aimed at, w « 
its value achieved up to iteration n of the simulation run and Pn 
is a suitable system parameter. x is a certain entry quantity 
suited to make M « converge against u , and *n+t is the value of * to 
be chosen for the next iteration. (Fig. 3) 
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